Intelligent Process Supervision Using Renforcement Learning and Temporal Abstraction
نویسنده
چکیده
Supervisory control usually involves timely switching among different courses of action over multiple time scales. In this work, intelligent process supervision is addressed in the context of semi-Markov decision processes and reinforcement learning. Temporally extended actions that represent a way of behaving together with a termination condition are used to achieve a set of operational goals/sub-goals comprising a supervision task. The control strategy resorts to a hierarchy of macro-actions or options which are made up of closed-loop sequences of low-level, primitive actions. Supervisory control of a buffer tank is discussed as a representative example. Copyright © 2002 IFAC
منابع مشابه
Apprentissage par renforcement pour les processus décisionnels de Markov partiellement observés Apprendre une extension sélective du passé
We present a new algorithm that extends the Reinforcement Learning framework to Partially Observed Markov Decision Processes (POMDP). The main idea of our method is to build a state extension, called exhaustive observable, which allow us to define a next processus that is Markovian. We bring the proof that solving this new process, to which classical RL methods can be applied, brings an optimal...
متن کاملHierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents
This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...
متن کاملTowards Intelligent Execution Supervision for Flexible Assembly Systems
Research results concerning error detection and recovery in robotized assembly systems, key components of flexible manufacturing systems, are presented. The approach to the integration of services and the modelling of tasks, resources and enviroment is described. A planning strategy and domain knowledge for nominal plan execution and for error recovery is presented. A supervision architecture p...
متن کاملControl of Multivariable Systems Based on Emotional Temporal Difference Learning Controller
One of the most important issues that we face in controlling delayed systems and non-minimum phase systems is to fulfill objective orientations simultaneously and in the best way possible. In this paper proposing a new method, an objective orientation is presented for controlling multi-objective systems. The principles of this method is based an emotional temporal difference learning, and has a...
متن کاملEmotional Learning Based Intelligent Controller for MIMO Peripheral Milling Process
During the milling process, one of the most important factors in reducing tool life expectancy and quality of workpiece is the chattering phenomenon due to self-excitation. The milling process is considered as a MIMO strongly coupled nonlinear plant with time delay terms in cutting forces. We stabilize the plant using two independent Emotional Learning-based Intelligent Controller (ELIC) in par...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002